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1 Introduction 

For every set of integers R = {xi , . . . ,Xr}, there is a corresponding sumset with all pairwise sums 

R + R := {Xi+Xj ■■l<i,j <r} (1) 
and a difference set with all possible pairwise differences 

R- R ■- {,T, - Xj : 1 < i,i < r}. (2) 

Various properties of these sets are of interest to us, in particular, when R is taken to be a random subset of {0, . . . , n — 1} for some positive 
integer n. Here we introduce some terminologies to be used throughout the paper. 

Definition 1.1. For every n G N, let Rn be the random variable denoting a uniformly randomly chosen subset of {0, . . . , n — 1} with 

P [i G Rn] ~ \ for all i satisfying < i < n — 1. Define R!^ and _R" similarly with the extra conditions that G ii^ and {0, n — 1} C R!^- 
A subscript of cx) indicates that the set is taken from all nonnegative integers. 



2 Probability of n as a sum given that n - 1 is a sum 

Associated with each integer fe > is the probability of it being found in R^ + R^a- Martin and O'Bryant [2006] derived an explicit form 
for this probability: 



l_(3)('=+i)/2, if A: is Odd 
l-i(i)''^^ if A; is even. 



P [fc G 7?oo + 7?oo] = <i, r/3.fc/2 ' .... . (3) 



In the following theorem, we extend this notion to find the probability of k being a sum given that A; — 1 is a sum. 
Theorem 2.1. For every integer A; > 1, the conditional probability 

p. , „ _L (_])knlk/2i _ 2*: + ! 

P[k€Roo+Roo\k-leRo. + Roo] = 2+ 2(2fe-3Lfc/2J) ' ^"^^ 

where Ffc+2 is the {k + 2)-th term in the Fibonacci sequence, givenly explicitly as 

(l + V5)''+^-(l-^)''+^ 



Proof. As the actual derivation is rather tedious, I will spare the details and simply outline the proof. By definition of conditional probability, 

P\keR +R \k-ieR +R ] - P[{k-hk}cR^ + R^\ 

for which the denominator is easily determined. As for the numerator, let us introduce some new notations. For i satisfying < i < |, let 
Ef be the event that 

{i, A; - i} C J?c^ V {i + 1, A; - (i + 1)} C iioo V . . . V { [|J , A; - [|J } C J?c^, (7) 
and for i satisfying <i< ^JetEi'^^be the event that 

{i, (A; - 1) - i} C Poc V {i + 1, (fe - 1) - (i + 1)} C -Roo V . . . V { ['^^^J , (n - 1) - [ } C R^. (8) 



Hence the numerator in Equation 6 may be re-expressed as P \Eq ^ A Eq\ . It turns out that, with proper conditioning of terms, we can write 
down a recursive formula for probabilities of this form: 

p[E':-^^E'l\ = |p[i5lV-i^AEf+i] +|p[iJlV-iiA£^+2] +... (9) 
p[£f-iAS^+i] = \p [e':+^ ^E':+^+\p [Et+^ ^E1+^]+.... (10) 

Now if we denote P [iJf A ] andP \E^~^ A -Ef^_i] respectively by P2i(fc) andP2i+i(A;), we have the recursive sequence {Pi(A;)}o<i<fc-i 
with Pk-i{k) = Pk-2{k) = I and 

Pi{k) = ip,+i(fc) + ip,+2(fe) -^(3 + (-!)'=-*) (I) L^^-^'/'J + i fori = 0,l,...,fc-3. (11) 

Noticing that the recursion depends on — i rather than on k and i independently prompts us to introduce yet another sequence Tj := 
Pk-j{k) = Pk-j-i{k — 1) = . . . = Po{j) with Ti = T2 = J. For k > 3, the recursive relation works out to be 

Fk+2 3L'=/2J (4-(-l)'=) 
n = 1 + ^ ' 

where {Fi} is the Fibonacci sequence. From this formula yields the desired probabiUty since 

P[fc G Poo + J?oo|fc- 1 e Poo + ^:oo] = — — r^— r (13) 

[re — i fc /too + -KooJ 



To 



i-i(3 + (-i)'=)(!)^'=/'J' 



(14) 

□ 



3 Relating several distributions for number of missing sums 

Knowing the probability P [A; e Poo + Pool A; — I € Poo + Poo] is only the tip of the iceberg. What we are really aiming to find out is, for 
any n > 0, how P„ + P„ varies in size. Certainly, #{Pn + Pn} varies depending on n, so instead we turn our focus to the number of sums 
it misses in the range of all possible sums, {0, . . . , 2n — 2}. For convenience, we will incorporate a few new notations. For any integer set P 
and real interval [a, b], define (P) as the number of sums R + R misses in the interval [a, b]. Similarly, let /j~ (P) be the number of 
differences P — P misses in [o, b]. More precisely, 

fta,b]iR) ■■= #{keZn[a,b]:k^R + R}, (15) 
f[a,b]^R) ■■= #{keZn[a,b]:k^R-R}. (16) 

For the three special sets P„, R'„ and R'n defined earlier, we are primarily concerned with the sums they miss in the intervals [0,n — 1] and 
[0, 2n — 2], along with the differences missed in [0, n — 1] and [— (n — 1), n — 1]. Note that the differences missed in [— (n — 1), n — 1] are 
exactly those missed in [0, n — 1], plus their negative coimterparts. 

Martin and O'Bryant [2006] studied several of these probabiUties, including P 2n-2] (^") ~ ™] ' probability of P„ + P„ missing 
exactly m sums in [0, 2n — 2]. From randomly generated data, they speculated that underneath this distribution Ues a more fundamental dis- 
tribution involving P j^/j^ (P,, ) = mj , which in turn is built upon the distribution for P j^/^ (R'n) = w-j (Figure 1). The following 
theorem states the exact relationship between these distributions in the limiting case as n approaches infinity: 

Theorem 3.1. Let m be a nonnegative integer. Then we have 

i™o^ [•''[0.2n-2](«n) = H = £ J™ ^ [/[O.n-l] (^") = ^] ^ ^0-1] (■^) = ' 

r 1 L"i/2J r 1 



The proof involves counting the missing sums separately, for example, splitting them into small sums (ranging from to n — 1) and large 
sums (ranging from n to 2n — 2). For written simplicity, we employ the following subscripts to denote the intervals involved (the possible 
parameters P„ and R'n are omitted here): 

(tt)/JzZ '■= /l0,2n-2] := /[0,n-l] {c)ft '■= f[n,2„-2] 

id)fxs ~ /[0,Ln/8J-l] (e)/MS ~ /[Ln/8J.n-l] {D/mL ■= /[n,(2n-2) - [n/SJ ] 

{9)fxL ■= /[(2n-2)-Ln/8J+l,2n-2] (/i)/m ■= /[[n/SJ ,(2n-2)- [n/8J] 
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Figure 1: The distributions for the number of missing sums for (a) Rn in [0, 2n — 2], (b) Rn in [0, n — 1], and (c) R'^ in [0, n — 1]. 



Here are a few lemmas to facilitate the proof of the theorem: 
Lemma 3.2. 

lim P[f^,{Rn) > 1] =0. 



(17) 



Proof. From Equation 3, we can derive the inequality P [k ^ _R„ + _R„] < (|) ^^''^^ , so 



(2ri-2)-[n/8J 

lim P f/i(i?„) > ll < lim y P [fc ^ i?„ + R„ 

n— foe n— >oo — ' 

fc=Ln/8J 

(2n-2)-[n/8J 

\ Lfc/2J 



< 



lim y 

fc=Ln/8J 
(2ri-2)-[n/8J 

lim y 

fc=Ln/8J 



3^ LL"/8J/2J 



< lim (2n- 2[n/8j - 1) (|) 



3X LL'^/8J/2J 



n— >oo 





since the exponential decay of (|) LL"/*J/2J g^ceeds the linear growth of 2n — 2^^\ — 1. A probability is always nonnegative, hence 
lim P > ll must equal zero. □ 



(18) 
(19) 

(20) 

(21) 
(22) 



Lemma 3.3. For any integer m > 0, the limit of the probability 



lim V P[/+s(i^„) = aA/+(i^„) = &A/+i(J^„) = c] = 0. (23) 



n—^oo 

a,c>0,fa>l,a+6+c=m 



Proof. Using Lemma 3.2, we have 



lim Yl P [fxsiRn) = aA /+(7?.) - b A f^^Rr.) = c] < lim ^ P [/i(i^n) = b] (24) 

a,c>0,6>l,a+6+c='m 6>1 

< lim P [fti{Rn) > 1] (25) 
= 0. (26) 

□ 

Lemma 3.4. For any integer m satisfying m> 0, the probability 

P[fis{Rn)=i^ftL{Rn)=m-i] = P[f+s{Rn) = l]P[fiLiRn)=m-i]. (27) 

Proof. To prove the lemma, we need to show that /^g (i?n ) = « and fxL^R^) ='m — iwt independent events. By definition, 

IxsiRn) ■= f[0,in/S\-l]{Rn) (28) 
LtJ -#{{0.---LtJ -l}n(Jin+-Rn)} 



LtJ -#{({o^---.LtJ-i}nfin) + ({o,...,LtJ -i}ni?„)} 



(29) 
(30) 



where the last step uses the fact that any sum of Rn in the range [O, [|;J — l] can only be the sum of two elements in 7?„ in that range. 
Similarly, 

+ LtJ -#{({2n- [tJ -l,...,2n-2}nfl„) + ({2n- [gj -l,...,2n-2}nE„)} 

JxL\Rn) = i^^-j , (31) 

since large sums only result from adding two large elements together 

Notice that /xs(-Rn) is a function of n and {O, . . . , L|J — 1} n i?„ while fxL^Rn) is a function of n and {2n — [|J — 1, . . . , 2n — 2} n 
Rn- Since {O, . . . , [|J — l}ni?„ and {2n — [|J — 1, . . . , 2n — 2} nii„ as disjoint, for any fixed n, the values of fxs{Rn) and/3^^(J?„) 
are independent. □ 

Lemma 3.5. For any integer m satisfying rn > 0, the limit of the probability 

7n m 

lim J2 P [fis {Rn) = i]P [f^L {Rn)=m-i] = lim ^ P [/+ = i] P [f+ {R„)=m-i]. (32) 

i=0 i=0 



Proof. To prove the statement, we will show both 

lim P [fisiRn) =i] = lim P = i] (33) 

n—^oo n— J-oo 

lim P [/+^ {Rn)=m-i] = lim P [/+ = m - i] . (34) 

n— >oo n— >cx) 

Re-express the probabiUty 

P[fsiRn) = i] = ^^[/is(^n) = «A/+s(i^„) = o]+p[/+s(J^„) = ^-/+(i^„)A/+s(J^„)>l] (35) 

= P[fis{Rn)=i^f^siRn)=0]+O{P[f+siRn)>l]) (36) 

= [fisiRn) =i]-P [fis(Rn) = i A ft,s{R^) >A+0{P IftrsiRn) > l] ) (37) 

= P [fisiRn) ='^]+0 {P [/is(^n) > 1]) + O (P [f^siRn) > l] ) (38) 

= P[ftsiRn)='i-]+0{P[f+s{Rn)>l]) (39) 

= P[fisiR^)=i]+0{P[f^{R„)>l]). (40) 



Since lim P > 1 = by Lemma 3.2, we have 

n— >-oo 

lim P [fURn) -i] = lim P [/is(^«) = «] • (41) 

n— >oo n— >oo 

Analogously, we can show that 

lim P [f^:{Rn) = i] = lim P [/^^(i?^) = i] ■ (42) 

n— >oo n— J'oo 

□ 

Now we are in position to prove Theorem 3.1, starting with Part (a). To avoid redundancy, we will write in place of f^{Rn) throughout 
this proof, combined with various pre-defined subscripts. Now to analyze P [/j^j = m] , first partition /j^^ into fxs^ Im' fxL' followed 
by conditioning on whether = 0, as shown: 

P[fM=m] = ^'[/is = «A/+ =6A/+i=c] (43) 

a , 6 , c > , a . + 6+ c— m 

E P[fis = aAftt = 0^fiL=c]+ Yl P[fis = ciAf+=bAf+i^ = c](44) 

ajC>0,a + c— m a,c>0,&>l,a + 6+c— m 

m 

= J2Pifis=i/^fM = 0/^fiL=m-i]+ ^[/is = aA/+ =6A/+^ = c]. (45) 

<=0 a,c>0,6>r,a+6+c=m. 

Taking the limit as n approaches infinity then applying Lemma 3.2, we have 

m 

lim P[/„+=m] = lim EP[/+s = iA/+ =OA/+^ = m-i]. (46) 
The RHS of the equation can be rewritten as 

m m 

lim YP[fxs=i^fxL=m-i] - lim ;E P [/^s = i A /+ > 1 A /+i = m - i] . (47) 
Taking the absolute value of Equation 47, we get 

I m m 

lim EP[/is=*A/+i=m-i]- lim ^ P [Z+g = i A /+ = A /+i = m - i] (48) 

m 

< lim Vp[/+s = iA/+ >lA/+^ = m-i] (49) 

i=() 

< lim P > ll = 0, by Lemma 3.2. (50) 
So the expression in Equation 47 equals zero. In other words, 

m m 

lim Ep[/+s =jA/+ =0A/+^ =m-i] = lim ^ P = i A /+^ = m - i] , (51) 

i=0 i=0 
m 

which imphes lim P [/j;, = m] = lim ^ P Ifts = ^ A /i. = m — il . Then by Lemma 3.4 and 3.5, 

m 

lim P [/„+ = m] = lim V P [fis = i\ P [f^L = m - i] (52) 

m 

= lim V P [/j" = il P [/+ = m - il , by symmetry of /s and /l, (53) 

n— J-oo ^— ' L J L J 

m 

lim V P f/s = i] P [/s = m - il , as desired. (54) 



n—^oo ' 

i=0 



Now for Part (b), begin by conditioning P [fg (P„) = m] on whether the sumset P„ + Rn is empty, which yields 

P [P„ + P„ = 0] P [/l(Pn) = m\Rn + P„ = 0] + P [P„ + P„ 7^ 0] P [/l(Pn) = m|P„ + P„ 7^ 0] . (55) 



The first temi in Equation 55 equals P [Rn t^9\P [fg (-Rn ) = m\fg{R„) = n] = ( ^ ) • l[m=n]- Hence 

lim P [R„ + i?„ = 0] P [ftiRn) = m\R„ + i^„ = 0] = 0. 

n— f oo 

Then condition the second term in Equation 55 on the value of the minimum sum, min(i?„ + Rn): 

P [Rn +Rn¥=^P [/[O.n-l] (^») = + Rn ^] 

2n — 2 

= P [min(i?n + Rn) = i]P [/[o,„-i] (Rn) = m\ min(7?„ + Rn) = i\ 

m 

= ^ P [mm{Rn + Rn)=i]P [/[o,„-i] (-Rn) = m\ min(i?„ + i^„) = i] 

i=0 
Lm/2J 

= ^'[min(P„) =i]P =m|min(i^„) = i] 

1=0 

L'"/2J 

= ^P[fto,n-i]iRn) = m\mm{Rn)=i] 

i=0 

L™/2J 

= E ^^[/[o,n-i]K-i + i) = mJ 

Lm/2J 

= E ^^[/[i,n-l](-Rn-i + i)=m-2iJ 
i=0 
Lm/2J 



Express this result as 

Lm/2J 



E [fto:n-i-l]{R'n-i) - ftn-2i-l,n-i-l]{R'n-i) = m - 2i\ 
i=0 

L-/2J , 

E [f^O,n-i-l]iR'n-i) = m - 2i A /+_2,_i,„_,_i]«_,) = Oj - 
i=0 
L-/2J 

- E [/[0.n-i-l](^n-i) = m - 2i A /[+_2,_i.„_,_i,(P;_,) > ij . 



In Equation 68, the limit of the second term 

Lm/2J 



iifi E [fln-.-lliR'n-i) = m - 2i A > 1)] 

i=0 

1-/2] . 
i=0 

- i™, E [/[+-2i-i,„-i-i] {Rn-i) > l] , and for n » m, 

Lm/2J 

^ 1™ E [/i(^n-i) > 1] = 0, by Lemma 3.2. 



Lm/2J |- , 

Hence, lim J2 ^P \f+_._^JR!^_i) = m - 2i A f+_^ 

Tt — yoo j— L ' L ' J J 



expands into three terms, two of which converge to as n — )• oo, leaving us 

Lm/2J 



lim P[/+(i?„) = m] = Jim ^ (7iU) = m-2iA/+_2,_i,„^^^ (73) 

Lm/2J 

= i™„ E [/s = m - 2i A = Oj . (74) 



By the above equation, the difference 



Lm./2i 

1™ E ^-f" UsiK-i) = m-2i]- lim P [/+(i?„) = m] 



n— >-00 ^— ' 2' 

i=0 



Lm/2J 



n—i-oo ^ — ' 2* 
i=0 



Hence, 



(75) 



^ i™, E [/s K-i) = m - 2i A /[:_2,_i,„_,_i]K_i) > l] (76) 

Lm/2J 

^ A- E 2^^'[/S-2.-i,n-.-i,(K-.)>l] (77) 

i=0 

Lm/2J 

^ i™, E [/[+_2i-i,„-i-i] (^n-i) > l] , and for n » m, (78) 

Lm/2J 

^ 1™ V x^P[/ji(7?„-.) > 1], by Lemma 3.2. (79) 



(80) 



Lm/2J 

lim P[/+(i?„) = m] = lim ^ p [/+(p;_,) = m - 2i] (81) 

i=0 

Lm/2J 

= 1™ E [/s (^«) = m - 2z] , as required. (82) 

In fact, with this method, we can extend Theorem 3.1 as follows (proof omitted): 
Theorem 3.6. Let mbe a nonnegative integer. Then we have 

m 

(a) lim P [/i = m] = E 1™ P [h = «1 lim P [/2 = m - i], 

n— >-oo i— n— >-oo 

Wter. (/l,/2) e {(/[t2„-2](-Rn)./[tn-ll(^«)) ' (/[0,2n-21 (O- /[^n-l] (O) } 

L™/2J 

fi>j lim P [/3 = m] = E iiTT 1™ P[h=m- 2i], 

n— >-oo ^ n— >oo 

w/tere (/3,/4) e {(/J,„-l](^n), /[+„_!] (P;)) , (/[t2„-2]('Rn),/[+2„_2](K)) , (/[t2n-2] (O: /[t2n-2] (O) > 
(•^[0,2n-2](-'^")'/[0,2»-2](-^")) ' (/[0,2n-2] (-^n)' /[0,2n-2] (-^")) }■ 

4 Missing fewer sums and differences is more probabie? 

Now that we know how the distributions of number of missing sums and differences are related, let us turn our attention to the properties of 
each one. The most fundamental distributions for sums and differences are P [^/jj ^ -^j {R'n) = "^] and P ^ -^j {R'n) = "ij , shown in 
Figure 2. 

While both appear to be decreasing at a roughly exponential rate, the tiny distribution actually has a tiny blip in the tail. We will state this as 
a theorem: 

Theorem 4.1. For m>0,P (P^) = mj is not decreasing in m except for n = 1,2, 3, 5, 9. 
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Figure 2: The observed distributions of (a) the number of missing sums for R'„ in [0, n — 1] and (b) the number of missing differences for _R" 
in [0, n — 1]. 



Proof. Letp„(m) = P j^/jg {R'n) = ■ For n < 10, we can easily verify tlie claim by determining tiie exact function. Figure 3 sfiows 
whether or not these functions are decreasing. 
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Figure 3: Exact distributions of the number of missing differences for small values of n 



To prove the theorem for n > 11, we want to find some m € {1, . . . ,n} such that 

p„{m) - pn{m - 1) > 0. (83) 

Let us evaluate p„ (m) starting with m = n,n— l,n — 2,n — 3, .... 

• p„(n) = P [R!;^ - R'^ = 9] = 0, since {0, n - 1} C implies {0, n-l}cK-R'^ 

• Pn{n - 1) = 

• pr,{n-2) = P[R';, = {0,n-l}] = ^ 

. pn{n-3) = P[K = {0,^,n-l}] = (^) -Ipin-i] 

. p„(n - 4) = P [rZ = {0, a,n-l}\a^^]+P[R'^ = {0,l{n-l),l{n-l),n- 1}] 

= ^^r:2" ~ (2^) ■ -^Pln-l] + (^^^=5) ■ l[3|n-l] 

• p„(n - 5) = P [Ji:^ = {0,d, (n - 1) - d,n - l}|d # |(n - 1)] + 

+P [P;; = {0, i(n - 1), |(n - 1), (n - = da = fda] + 

+P [p:: = {0, i(n - 1), |(n - 1), (n - l)}||di = da = ds] + 
+P [p:; = {0, |(n - 1), i(n - 1), |(n - l),n - 1}] + 



So in the case of m = n — 4, 

Pn{m) - pn{m - 1) = p„(n - 4) -p„(n - 5) (84) 

= ^^r=l ~ (2?;^) l[2|n-l] + (^) l[3|n-l] " gn^ + (2^) h^ln-l] - (^) 1[4|„-1] (85) 

> n-2 1 r^1-l 3 

— 2"~2 2"~2 2"~2 2"~2 
n-5- § 

> 2„-a ' (87) 
= '^>0. (88) 

2n— 1 ^ ^ 

□ 

Using the same idea of analyzing the tail-end distribution, we can extend Theorem 4.2 to include several distributions that seem to be 
decreasing after a certain point (usually the maximum) but in fact are not due to small blips in the tails. 

Theorem 4.2. In the following distributions, the function reaches a maximum before exhibiting a decreasing trend. For each distribution, let 

m* be where the maximum is reachedjhen we have 

(a) For m > m*, P ^f^ {R'n) = rn^ is not decreasing except for n = 1,2,3,5,9. 

(b) For m > m*, P {R'n) = not decreasing except for n = 1,2. 

(c) For m > m*, P (Pn) = mj is not decreasing except for n = 1. 
(dj For m > m*, P 2n-2] i^n) = '"•j is not decreasing except for n = 1,2. 

(e) Form > m*, P 2n-2]i-R'n) — "^j is not decreasing except for n = 1. 

(f) Form > m* , P [/[o 2n-2](^") ~ "^j is not decreasing except for n = 1. 

For large enough values ofn, these distributions stabilize and reach a limit, as shown in Figure 4. 

Basically, the theorem says that, even in the parts of the graphs that exhibit a downward trend, these functions are not strictly decreasing 
due to some blips in the tail-end. Notice that the theorem does not assert anything for the limiting distributions because it is very possible 

for the limiting function to be decreasing, as in the case of P ^_ (P") = mj , for example. Another interesting observation is that the 

functions dealing with the number of missing sums in the interval [0, n — 1] may be decreasing. We will formally state this as a conjecture: 
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Figure 4: Non-decreasing distributions corresponding to Theorem 4.2 



Conjecture 4.1. For each distribution, let m* be where the maximum is reached, then we have 

(a) For m > m*, P {R'n) = "^-j is decreasing. 

(b) For m > m*, P (R'n) = ti^ is decreasing except for n = 4, 5, 6. 

(c) For m > m* , P j^/jj {Rn) ~ rn^ is decreasing except for n = 1. 



These claims are supported by Figure 5. 
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Figure 5: Distributions that appear to be decreasing as described in Conjecture 4.1 



5 A conjecture about the limiting distribution for differences 

Recall that P {R'^) — mj from Figure 4a and P ^_ {R'oo) ~ "^j from Figure 5b are the two fundamental distributions, but 

despite our efforts, we still do not know much about either except some decreasing properties from the last section. Presently, we want to find 
out more about the heght of the bars in the graphs through experimentation in Mathematica. 

Let us start with counting the sums. Instead of looking at all of R!ao, which almost surely spans [0, oo), we will focus on only the elements 
of R'^ the interval [0, 19]. By definition, G R'^. As for the other elements in [1, 19], we will experiment with all combinations. Since for 
each experiment, we fix which elements from [0, 19] are in R!^, we also know exactly which elements from [0, 19] are in R'^ + i?^ because 



elements greater than 19 do not affect sums in between and 19. Counting the missing sums for these sets in the interval [0, 19], we have a 
distribution as shown in Figure 6a. 



Notice the striking similarity between this graph and Figure 5a. This resemblance signifies that very few sets miss more than 20 sums, and 
the sums missed are almost less than 20. In fact, we can observe this pattern by focusing on an event smaller interval, say [0, 13]. Very few 
missed sums exceed 13. 
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Figure 6: Distributions for (a) the number of missing sums in [0, 19], and (b) the number of missing differences in [0, 10] 

Likewise, we can take a finite set, insist that both endpoints be included, and iterate through all possible combinations. Counting all relevant 
pairwise differences, we arrive at a distribution as shown in Figure 6b, which bears an uncanny resemblance to Figure 4a, hence demonstrating 
that nearly all missed differences are among the 10 largest possible differences. So if dmax is the largest possible difference, then {dmax — 
9, . . . , dmax} would comprisc virtually all the missed differences. 

Let us draw our attention to the graph concerning missing differences. Recall that Figure 6b is obtained by iterating through all subsets of 
[0, 19] containing {0, 19} and counting the number of missing differences from [0, 10]. Now, for n = 1, . . . , 9, we can do the same by taking 
all subsets of [0, 2n — 1] containing {0, 2n — 1} and counting missing differences from [0, n], we get the following distributions as shown in 
Figure 7. 

These graphs suggest that as n approaches oo, a limiting distribution exists and, in addition, is the same as the one shown in Figure 4a. Let 
us state this as a lemma and prove it before proceeding. 



Lemma 5.1. 



lim P 

n — ^ oo 



[/[0,n-l] [Rn) 

Proof. We have two known identities. The first 

lim ■P[./[o.„-i](fin) 
is obtained by substituting n with 2n — 1, and the second is 

m 

1™ P [./■[0,2„-ll(^2n) = "ij = I] ( 1™ \f^,.^-l]{R'L) =i\ lim J' f./'[;,2n-ll(-R2n) 
n— >oo L ^ ' J ^ — ^ \n^oo L ' ^ J n— >oo L ^ ' 



— m\ — 



lim P [/[„,2„-i](-R2n) = Jnl- 

lim P /[o,2„-l](-R2n) = ml. 



= m — t 



■])■ 
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So to prove the lemma, we need to show that hm P ^f^^ {R'in) = oj =1- Taking the probability of its complement, we have 

n 

lim P f/,0 {R'D > ij < lim V P [fc ^ R'L - i?2„] 

n 

= limV(|)'"-'= 

fe=0 

< lim (n+ 1) (I)" = 0. 
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Figure 7: Distribution of the number of missing differences for each value of n 



Then since lim P 



[/[(), n~l](-^2n 



> 1 = 0, we conclude that 



[/[o,„-i](fi") ^rn\^ 



lim P 

n—^OCi 



= lim -P[/[o,2„-l](-R2n)=ml 

= lim P f.f[;,2„_ij {R'D = m\ . 
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□ 



The significance of all this is that, while P (R'n) = ni^ is, in general, not a decreasing function, P {R'in) = "^j does seem 

to be decreasing, according to Theorem 4.2(a). If the decreasing trend continues, then its limiting distribution would also be decreasing. Note 
that this does not contradict with Theorem 4.2(a) since a sequence of functions does not need to be decreasing for its limit to be decreasing. 
In the proof of the theorem, we showed that a small blip at the tail-end of the distribution prevents it from being decreasing. If the blip is 
the only anomaly in an otherwise decreasing function, then as n approaches oo, the height of the blip would decrease to 0, resulting in a 
decreasing limiting distribution. Figure 7 serves to strengthen this last conjecture: 

Conjecture 5.1. For all integers m > 0, lim P {R'n) = mj is a decreasing function. 
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